Convolutional Neural Network based Age Estimation from Facial Image and Depth Prediction from Single Image
نویسندگان
چکیده
Convolutional neural network (CNN), one of the most commonly used deep learning methods, has been applied to various computer vision and pattern recognition tasks, and has achieved state-of-the-art performance. Most recent research work on CNN focuses on the innovations of the structure. This thesis explores both the innovation of structure and final label encoding of CNN. To evaluate the performance of our proposed network structure and label encoding method, two computer vision tasks are conducted, namely age estimation from facial image and depth estimation from a single image. For age estimation from facial image, we propose a novel hierarchical aggregation based deep network to learn aging features from facial images and apply our encoding method to transfer the discrete aging labels into a possibility label, which enables the CNN to conduct a classification task rather than regression task. In contrast to traditional aging features, where identical filter is applied to the entire facial image, our deep aging feature can capture both local and global cues in aging. Under our formulation, convolutional neural network (CNN) is employed to extract region specific features at lower layers. Then, low layer features are hierarchically aggregated by using fully connected way to consecutive higher layers. The resultant aging feature is of dimensionality 110, which achieves both good discriminative ability and efficiency. Experimental results of age prediction on the MORPH-II and the FG-NET databases show that the proposed deep aging feature outperforms state-of-the-art aging features by a margin. Depth estimation from a single image is an essential component toward understanding the 3D geometry of a scene. Compared with depth estimation from stereo images, depth map estimation from a single image is an extremely challenging task. This thesis addresses this task by regression with deep features, combined with surface normal constrained depth refinement. The proposed framework consists of two steps. First, we implement a convolutional neural network (CNN) to learn the mapping from multi-scale image patches to depth on the super-pixel level. In this step, we apply the proposed label encoding method to transfer the contin-
منابع مشابه
A Deep Model for Super-resolution Enhancement from a Single Image
This study presents a method to reconstruct a high-resolution image using a deep convolution neural network. We propose a deep model, entitled Deep Block Super Resolution (DBSR), by fusing the output features of a deep convolutional network and a shallow convolutional network. In this way, our model benefits from high frequency and low frequency features extracted from deep and shallow networks...
متن کاملA Convolutional Neural Network based on Adaptive Pooling for Classification of Noisy Images
Convolutional neural network is one of the effective methods for classifying images that performs learning using convolutional, pooling and fully-connected layers. All kinds of noise disrupt the operation of this network. Noise images reduce classification accuracy and increase convolutional neural network training time. Noise is an unwanted signal that destroys the original signal. Noise chang...
متن کاملUnified Depth Prediction and Intrinsic Image Decomposition from a Single Image via Joint Convolutional Neural Fields
We present a method for jointly predicting a depth map and intrinsic images from single-image input. The two tasks are formulated in a synergistic manner through a joint conditional random field (CRF) that is solved using a novel convolutional neural network (CNN) architecture, called the joint convolutional neural field (JCNF) model. Tailored to our joint estimation problem, JCNF differs from ...
متن کاملA Radon-based Convolutional Neural Network for Medical Image Retrieval
Image classification and retrieval systems have gained more attention because of easier access to high-tech medical imaging. However, the lack of availability of large-scaled balanced labelled data in medicine is still a challenge. Simplicity, practicality, efficiency, and effectiveness are the main targets in medical domain. To achieve these goals, Radon transformation, which is a well-known t...
متن کاملLearning Document Image Features With SqueezeNet Convolutional Neural Network
The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016